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CN ; The NP-complete Permutation Pattern Matching problem asks 

whether a permutation P can be matched into a permutation T. A matching 
is an order-preserving embedding of P into T. We present a fixed-parameter 
Q \ algorithm solving this problem with an exponential worst-case runtime of 

c/2 ' 0*(1.79 mn ( T )), where run(T) denotes the number of alternating runs of T. 

This is the first algorithm that improves upon the 0*{2 n ) runtime required 
by brute-force search without imposing restrictions on P and T. Furthermore 
we prove that - under standard complexity theoretic assumptions - such a 
fixed-parameter tractability result is not possible for run(P). 

(N 

^ ■ 1. Introduction 

O 



The concept of pattern avoidance (and, closely related, pattern matching) in permuta- 
tions arose in the late 1960ies. It was in an exercise of his Fundamental algorithms [15] 
that Knuth asked which permutations could be sorted using a single stack. The answer 
is simple: These are exactly the permutations avoiding the pattern 231 and they are 
counted by the Catalan numbers. By avoiding (resp. containing) a certain pattern the 
following is meant: The permutation tt = 53142 (written in one-line representation) 
contains the pattern 231, since the subsequence 342 of tt is order- isomorphic to 231. We 
call the subsequence 342 a matching of 231 into n. On the other hand, ir avoids the 
pattern 123 since it contains no increasing subsequence of length three. Since 1985, when 
the first systematic study of Restricted Permutations [21] was published by Simion and 
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Schmidt, the area of pattern avoidance in permutations has become a rapidly growing 
field of discrete mathematics, more specifically of (enumerative) combinatorics [5"lll4j. 

This paper takes the viewpoint of computational complexity. Computational aspects 
of pattern avoidance, in particular the analysis of the Permutation Pattern Match- 
ing (PPM) problem, have received far less attention than enumerative questions until 
now. The PPM problem is defined as follows: 

Permutation Pattern Matching (PPM) 

Instance: A permutation T (the text) of length n and a permutation P 

(the pattern) of length k < n. 
Question: Is there a matching of P into T? 



In [6] it was shown that PPM is in general NP-complete. From this result follows a 
trivial brute-force algorithm checking every length k subsequence of T. Its runtime is 
in C*(2 n ), i.e. is bounded by 2 n • poly{n). To the best of our knowledge, no algorithm 
with a runtime of C*((2 — e) n ) without restrictions on P and T is known yet. If such 
restrictions are imposed, improvements have been achieved. There are polynomial time 
algorithms in case of a separable pattern [21 [61 [131 [19]. Separable permutations avoid 
both 3142 and 2413. In case P is the identity 12 ... k, PPM consists of looking for an 
increasing subsequence of length k in the text - this is a special case of the Longest 
Increasing Subsequence problem. This problem can be solved in 0(nlogn)-time for 
sequences in general [20] and in C(nloglogn)-time for permutations [9lll7j . PPM can be 
solved in C(nlogn)-time for all patterns of length four [2]. An 0(k 2 n e )-time algorithm 
is presented in |12] for the case that both the text and the pattern are 321-avoiding. 

In this paper we tackle the problem of solving PPM faster than G*(2 n ) for arbitrary P 
and T. We achieve this by exploiting the decomposition of permutations into alternating 
runs. As an example, the permutation ir = 53142 has three alternating runs: 531 (down), 
4 (up) and 2 (down). We denote this number of ups and downs in a permutation ir 
by run(-7r). Alternating runs are a fundamental permutation statistic and had been 
studied already in the late 19th century by Andre [I]. An important result was the 
characterization of the distribution of run(-7r) in a random permutation: asymptotically, 
run(-7r) is normal with mean |(2|7r| — 1) [16]. Despite the importance of alternating runs 
within the study of permutations, the connection to PPM has so far not been explored. 

Contributions. In detail the contributions of this paper are the following: 

• We present a fixed-parameter algorithm for PPM with an exponential runtime 
of C*(1.79 mn ( T )). Since the combinatorial explosion is confined to run(T), this 
algorithm performs especially well when T has few alternating runs. Indeed, the 
runtime depends only polynomially on n, the length of T. 

• Since run(T) < n, this algorithm also solves PPM in time 0*(1.79 n ). This is a 
major improvement over the brute-force algorithm. 
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• Furthermore, we analyze this algorithm with respect to run(P). We obtain a run- 
time of 0*( (n 2 /2run(P)) mn ^ ). In the framework of parameterized complexity 
theory this runtime proves XP membership for run(P). 

• We also prove that an algorithm presented in [lj has a runtime of 0(n 1+ru "^). 
This is achieved by proving an inequation that bounds the pathwidth of a cer- 
tain graph generated by a permutation by the number of alternating runs of this 
permutations. 

• Finally, we prove that these XP results cannot be substantially improved. We 
prove that - under standard complexity theoretic assumptions - no fixed-parameter 
algorithm exists with respect to run(P), i.e. no algorithm with runtime 0*(c run ^) 
for some constant c may be hoped for. 

Related work. There is further related work to be mentioned. In [lj an algorithm for 
PPM with a runtime of 0( n °- 47fc +°( fc )) i s presented. 

The Longest Common Pattern problem is to find a longest common pattern be- 
tween two permutations T\ and T2, i.e. a pattern P of maximal length that can be 
matched both into T\ and Ti- This problem is a generalization of PPM since deter- 
mining whether the longest common pattern between T\ and T2 is T\ is equivalent to 
PPM. In [7] a polynomial time algorithm for the Longest Common Pattern problem 
is presented for the case that one of the two permutations T\ and T2 is separable. A 
generalization of this problem, the so called Longest Common C-Pattern problem 
was introduced in [8]. This problem consists of finding the longest common pattern 
among several permutations belonging to a class C of permutations. For the case that C 
is the class of all separable permutations and that the number of input permutations is 
fixed, the problem was shown to be polynomial time solvable. 

For a class of permutations X the Longest X-Subsequence (LXS) problem is 
to identify in a given permutation T its longest subsequence that is isomorphic to a 
permutation of X. In |3j polynomial time algorithms for many classes X are described, 
in general however LXS is NP-hard. 

Organization of the paper. Section [2] contains essential definitions for permutations 
and parameterized complexity theory. The main section, Section O describes the algo- 
rithm and proves runtime bounds and correctness. The hardness results can be found 
in Section HI We conclude with possible future research directions in Section [5] and 
acknowledgments in Section [6l 

2. Preliminaries 
2.1. Permutations 

For any m G N, let [m] denote the set {1, . . . , m} and [0,m] denote {0, 1, . . . ,m}. A 
permutation it on the set [m] can be seen as the sequence vr(l), vr(2), . . . , 7r(m). Viewing 
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permutations as sequences allows us to speak of subsequences of a permutation. We 
speak of a contiguous subsequence of ir if the sequence consists of contiguous elements 
in 7r. 

Definition 2.1. Let P (the pattern) be a permutation of length k. We say that the 
permutation T (the text) of length n contains P as a pattern or that P can be matched 
into T if we can find a subsequence ofT that is order-isomorphic to P. If there is no such 
subsequence we say that T avoids the pattern P. Matching P into T thus consists in 
finding a monotonically increasing map cp : [k] — >■ [n] so that the sequence <p(P), defined 
as ((^(P(i))) ig j fc p is a subsequence of T . 

Every permutation tt on [m] defines a total order -< n on [m]. We write i -< n j iff 
7r _1 (i) < 7T _1 (j), i.e. the value i stands to the left of the value j in 7r. When considering 
the minimum (maximum) of a subset S C [m] with respect to -K-^, we write min^iS 
(max,,- S). 

We discern two types of local extrema in permutations: valleys and peaks. A valley 
of a permutation tt is an element 7r(i) for which it holds that n(i — 1) > 7r(i) and 
7r(i) < n(i + 1). If 7r(i — 1) or 7r(i + 1) is not defined, we still speak of valleys. The set 
Val(n) contains all valleys of it. Similarly, a peak denotes an element 7r(i) for which it 
holds that n(i — 1) < ir(i) and ir(i) > n(i + 1). 

Valleys and peaks partition a permutation into contiguous monotone subsequences, 
so-called (alternating) runs. The first run of a given permutation starts with its first 
element (which is also the first local extremum) and ends with the second local ex- 
tremum. The second run starts with the following element and ends with the third 
local extremum. Continuing in this way, every element of the permutation belongs to 
exactly one alternating run. Observe that every alternating run is either increasing or 
decreasing. We therefore distinguish between runs up and runs down. Note that runs up 
always end with peaks and runs down always end with valleys. The parameter run(7r) 
counts the number of alternating runs in ir. Hence run(7r) + 1 equals the number of 
local extrema in tt. These definitions can be analogously extended to subsequences of 
permutations. 

Example 2.2. In the permutation 181247116329510 the valleys are 1, 4, 2 and 5 
and the peaks are 12, 11, 9 and 10. A decomposition into alternating runs is given 
by: 1812|4|711|632|9|5|10. A graphical representation can be found in Figure [T] on 
page [3 H 

2.2. Parameterized complexity theory 

We give the relevant definitions of parameterized complexity theory. In contrast to 
classical complexity theory, a parameterized complexity analysis studies the runtime of 
an algorithm with respect to an additional parameter and not just the input size \I\. 
Therefore every parameterized problem is considered as a subset of S* x N, where £ is 
the input alphabet. An instance of a parameterized problem consequently consists of an 
input string together with a positive integer p, the parameter. 



4 



Definition 2.3. A parameterized problem P is fixed-parameter tractable (or in FPT) if 

there is a computable function f and an integer c such that there is an algorithm solving 
P in time G(f(p) ■ \I\ C ). 

The algorithm itself is also called fixed-parameter tractable. In this paper we want 
to focus on the exponential runtime of algorithms, i.e. the function /, and therefore use 
the O* notation which neglects polynomial factors. 

A central concept in parameterized complexity theory are fixed-parameter tractable 
reductions. 

Definition 2.4. Let L%,L2 Q S* X N be two parameterized problems. An fpt-reduction 
from L\ to L2 is a mapping R : S* x N — > S* x N such that 

• (I,fc)€Li iffR(I,k)€L 2 . 

• R is computable by an fpt- algorithm. 

• There is a computable function g such that for R(I, k) = (J', k'), k! < g(k) holds. 

FPT is the parameterized equivalent of PTIME. Other important complexity classes in 
the framework of parameterized complexity are W[l] C W[2] C . . the W-hierarchy. For 
our purpose, only the class W[l] is relevant. It is conjectured (and widely believed) that 
W[l] ^ FPT. Therefore showing W[l]-hardness can be considered as evidence that the 
problem is not fixed-parameter tractable. 

Definition 2.5. The class W[l] is defined as the class of all problems that are fpt- 
reducible to the following problem. 



Turing Machine Acceptance 


Instance: 


A nondeterministic Turing machine with its transition table, 




an input word x and a positive integer k. 


Parameter: 


k 


Question: 


Does the Turing machine accept the input x in at most k 




steps? 



Definition 2.6. A parameterized problem is in XP if it can be solved in time 0{\I\^ k ^) 
where f is a computable function. 



All the aforementioned classes are closed under fpt-reductions. The following relations 
between these complexity classes are known: 

FPT C W[l] C W[2] C...CXP 

Further details can be found e.g. in the monographs ^0] [Tl ] H8]. 
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3. The alternating run algorithm 



We start with an outline of the alternating run algorithm. Its description consists of 
two parts. In Part 1 we introduce so-called matching functions. These functions map 
runs in P to sequences of adjacent runs in T. The intention behind matching functions 
is to restrict the search space to certain length k subsequences, namely to those where 
all elements in a run in P are mapped to elements in the corresponding sequences of 
runs in T. In Part 2 a dynamic programming algorithm is described. It checks for every 
matching function whether it is possible to find a compatible matching. This is done by 
finding a small set of representative elements to which the element 1 can be mapped to, 
then - for a given choice for 1 - finding representative values for 2, and so on. 

Theorem 3.1. The alternating run algorithm solves PPM in time C*(1.79 run ^). There- 
fore PPM parameterized by run(T) is in FPT. 

Corollary 3.2. The alternating run algorithm solves PPM in time 0*(1.79 n ) where n 
is the length of the text T . 

Throughout this section we will use the input instance (T ex ,P ex ) which is given by 
T ex = 181247116329510 and P ex = 2314 as a running example. 

Part 1: Matching functions. We introduce the concept of matching functions. These 
are functions from [run(P)], i.e. runs in P, to sequences of adjacent runs in T. For a given 
matching function F the search space in T is restricted to matchings where an element 
i contained in the j-th. run in P is matched to an element in F(j). Two adjacent runs 
in P are mapped to sequences of runs that overlap with exactly one run. This overlap is 
necessary since elements in different runs in P may be matched to elements in the same 
run in T. More precisely, valleys and peaks in P might be matched to the same run in 
T as their successors (see the following example). 

Example 3.3. In Figure Q] P ex (left-hand side) and T ex (right-hand side) are depicted 
together with a matching function F. A matching compatible with F is given by 4 6 2 9. 
We can see that the elements 6 and 2 lie in the same run in T ex even though 3 (a peak) 
and 1 (its successor) lie in different runs in P ex . H 

Definition 3.4. A matching function F maps an element of [run(P)] to a subsequence 
of T . It has to satisfy the following properties for all i 6 [run(P)]. 

(PI) F{i) is a contiguous subsequence ofT. 

(P2) If the i-th run in P is a run up (down), F(i) starts with an element following a 
valley (peak) or the first element in T and ends with a valley (peak) or the last 
element in T. 

(PS) P(l) starts with the first and P(run(P)) ends with the last element in T. 

(P4) F{i) and F(i + 1) have one run in common: F(i + 1) starts with the leftmost 
element in the last run in F(i). 
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=F(2)=F(3) 



Figure 1 : P ex and T ex together with a matching function F and the compatible matching 
4629 



Property (P2) implies that every run up is matched into an M-shaped sequence of runs 
of the form up-down- up-... -up-down (if the run up is the first or the last run in P the 
sequence might start or end differently) and every run down is matched into a W-shaped 
sequence of runs of the form down- up-down-... -down-up (again, if the run down is the 
first or the last run in P, the sequence might start or end differently). These M- and 
W-shaped sequences and their overlap are sketched in Figure [2j 



run up in P 

run down in P 



Figure 2: A sketch of a matching function and its M- and W-shaped subsequences 



The properties (P1)-(P4) guarantee that the number of functions we have to consider 
depends only on run(T) and thus allow us to iterate over all matching functions in fpt 
time. 

Lemma 3.5. There are at most \/2 run ^ T functions from [runP] to subsequences of T 
that satisfy (PI) to (P4). 

Proof. A matching function F can be uniquely characterized by fixing the position of 
the first run up in every F{i) for i S [run(P)]. This is because the last run of F(i) is the 
first run of F(i + 1) for all i E [run(P) — 1]. Moreover the first run up in F(l) is always 
the first run up in T. Thus the number of matching functions is equal to the number of 
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possibilities of picking run(P) — 1 (for the first run in P no choice has to be made) runs 
among the at most |~run(T)/2] runs up in T. Hence there are at most 

[run(T)/2]\ rrun(T)/2l -i /9 run ( T ) 
run(P)-iy " 

such functions. The first inequality holds since (?) < 2 n ~ 1 for all n, k E N as can easily 
be proven by induction over n. □ 



Part 2: Finding a matching. When checking whether T contains P as a pattern, it is 
sufficient to test for all matching functions whether there exists a compatible matching. A 
matching is compatible with a matching function F if an element i contained in the j-th 
run in P is matched to an element in F(J). This is checked by a dynamic programming 
algorithm. The algorithm computes the data structure X K for each k E [k]. X K is a 
subset of [0,n] run ^ and contains representative choices for the matching of the largest 
element in each run in P that is < k. (X k does of course depend on F but we omit this 
in the notation.) 

Let us explain what is meant by representative choices. We search for a compatible 
matching of P into T by successively determining possible elements for 1,2, ... ,k. Given 
a choice for k E [k], possible choices for n + 1 are necessarily larger. In addition, it is 
always preferable to choose elements that are as small as possible. To be more precise: 
if v € [n] has been chosen for n E [k], we merely need to consider the valleys of the 
subsequence of T containing all elements larger than v. Indeed, if any matching of P 
into T can be found, it is also possible to find a matching that only involves valleys in the 
above-mentioned subsequences. Therefore our algorithm will only consider such valleys 
- we call these elements representative. As an example, consider again Figure [TJ Here 
4 6 3 10 is a matching of P ex into T ex where the elements 3 and 10 are not representative. 
This can be seen since 3 is not a valley and 10 is not a valley in the subsequence consisting 
of elements larger than 6. However, this matching can be represented by the matching 
46 2 9 that only involves representative elements (3 is represented by 2; 10 by 9). 

Furthermore, observe that when possible elements for 1,2, ... ,k are successively de- 
termined, we move from left to right in runs up and from right to left in runs down. 
Hence the chosen elements do not only have to be larger than the previously chosen 
element but also have to lie on the correct side of the previously chosen element in the 
same run. These observations are captured in the following definition. 

Definition 3.6. For a permutation ir on [n] and integers i,j < n, we define ^u(i,j) 
(^D<i,j)) as th e subsequence of it consisting of all elements that are right (left) of j and 
larger than i. Then URep(ir,i,j) := Val^jj^^) (resp. DRep(n,i,j) := Val^j^u^)) 
corresponds to the set of representative elements for the case of a run up (resp. down). 

For an example, see Figure [3] where representative elements are shown for the permu- 
tation T ex , i = 3 and j = 2. 

We now describe how the algorithm checks whether there is a matching from P into 
T compatible with the matching function F. The data structure X K consists of run(P)- 
tuples with entries in [0, n]. The i-th component of a tuple in X K is a representative 



8 



D 



x 



x 



X 



Val(T) 

DRep(T, 3, 2) 
U URep(T,3,2) 




Figure 3: Illustrating Definition 13.6 



choice for the largest element < k that lies in the i-th run. Limiting X K to representative 
elements allows to bound its size by 1.262 mn ^. In order to achieve this upper bound, 
we impose the following conditions (CI) and (C2) and remove unnecessary elements by 
applying the rules (Rl) and (R2). In order to state these conditions and rules, we write 
r(«) = i iff k is contained in the i-th run in P. For notational convenience we define 
r(0) := 1. 

First, we set Xq := {(0,0, . . . ,0)}. The set X K is then constructed from X K ^i as 
follows. Let x = (xi, . . . , x mn (p)) G X K —\. We now define N K>X to be the set of all 
v G [n] that satisfy (CI) and (C2). This set contains representative elements to which k 
may be mapped to for the given x. 



(CI) It has to hold that v G URep(F(r(K)),x 



analogously v G DRep(F(r(K)),x 



t(k— 1) ) 



r K \ ) in case k lies in a run up and 



f K \) in case n lies in a run down. 



r(«— 1) ' ■*'r( 

This condition ensures that v is larger than the previously chosen element for k — 1, 
i.e. larger than x r r K _iy Furthermore, it enforces v to lie on the correct side of x r r K \, 
the previously chosen element in this run. Instead of considering all such elements in 
F(r(n)) we only take into account representative elements. 

(C2) If k is not the largest element in its run in P, there has to exist £ G F(r(n)) with 
v < £ and v -<t £, for k appearing in a run up (£ <t v for k appearing in a run 
down) . 

This condition excludes a choice for k that cannot lead to a matching. A non-maximal 
element in a run up (down) in P has to be mapped to an element having larger elements 
to its right (left). We therefore exclude elements in the rightmost (leftmost) run of 
F(r(n)) if this is a run down (up). Condition (C2) is necessary to obtain the runtime 
bounds for the dynamic programming algorithm. 

As an intermediate step let X' K := {x(f) | x G X K ^\ and v G N KjX }, where x(^) := 
yxi, . . . ,x r r K \i,v, 2V(«)+l> " " " > x mn(P))' The tuple x(i/) thus differs from x only at the 
r(«)-th position. Note that two different elements x and x' in X K _i may lead to the 
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representative 
elements for 



= F(r(«)) = F(r(K')), 



= F(v(k - 1)) 



for k and k' in a run down 



for k — 1 in a run 



up 



Figure 4: Illustrating Rule (Rl) 



same element x(V) = x'(z/) in X K if they only differ in x r r K y Rule (Rl) describes how 
to compute X K from X' K . Stating it requires the following definition. 

Definition 3.7. Let it be a permutation of length n. A subsequence of it consisting of 
a consecutive run down and run up (formed like a V) is called a vale. If it starts with a 
run up, this run is also considered as a vale and analogously if it ends with a run down. 
For two elements v\,v% £ M, u i ~ u 2 if both lie in the same val^. For two k-tuples 
x, y € [n] k , x ~ y if for every i £ [k] it holds that Xi ~ yj. For a fixed set of k-tuples S 
and x £ S, the equivalence class [x]^ is defined as all y £ S with x ~ y. 

(Rl) We set X K := {min( r ( K )) ([x]^) | x 6 X^.}, where min( r ( K ))([x]^) picks the tuple in 
[x]^, with the smallest value at the r(n)-th position. If this minimum is not unique, 
it arbitrarily picks one candidate. 

This rule is the key to prove the 1.2611 run ( T ) upper bound on \X K \. It is based on the 
observation that it is enough to keep a single tuple for each [x]^. This means that for a 
set of tuples with coinciding vales it is enough to consider one of them. We provide an 
intuition about the rule and its correctness in the following example. 

Example 3.8. Consider the text permutation schematically represented in Figure HI 
We are searching for representative choices for k, an element lying in a run down. For 
k', the previous element lying in the same run as k, two representative elements are (ii 
(circle) and \i<i (square). They lead to one representative element for k — 1 each: if [i\ 
has been chosen v\ is a representative element (circle) and if ^2 has been chosen vi is 
one. Following condition (CI), we find three representative elements for k in F(r(n)): 
£1 (if v\ has been chosen), £2 and £3 (if i>2 has been chosen). 

We can now observe that it is not necessary to store all three representative elements 
for k. Indeed, in the vale containing £1 and £2 we only need to keep track of £1 since 

Note that every element in a permutation is contained in exactly one vale. 
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this is always a better choice than £2- This can be seen in the following way: In general, 
elements that lie further to the right (left) in a run down (up) might be preferable since 
they leave more possibilities for future elements that are to be matched. Within a vale 
however, the horizontal position does not make any difference, it is only the vertical 
position that matters. Here, the elements left of £2 and right of £1 are not available 
for following choices even if we choose £2 since they are smaller than £2- However, the 
elements left of £1 that are smaller than £2 are only available if we choose £1. H 

In the case that k is the largest element in its run, it is enough to consider a single 
representative element in F(r(n)). This is because the position of the element v is 
no longer relevant since no further elements have to be chosen in this run. Hence the 
following data reduction is performed on X K . 

(R2) Let M KjX := (y r ( K ) | y € X K A (y, = %{ Vi ^ r(/c))}. If k is the largest element in 
its run in P, each x = {x\, . . . , ar r ( re )_i, x r ^, s r («)+i, . . . ,x m n(P)) G X K is replaced 
by the tuple (x x , . . . , x r(K )_i, min(M KjX ), x r(K)+1 , . . . , x mn(P) ) . 

As a consequence there are no two tuples in X K that only differ at the r(/c)-th position 
in this case. 

Termination. For a given matching function F, the algorithm described in Part 2 
terminates as soon as we have reached X^, Observe that Xk is always empty if a 
previous X K was empty. If for any F the data structure X^ is non-empty, P can be 
matched into T. 

Example 3.9. Let us demonstrate with the help of a simple example how the alternating 
run algorithm works. Consider the text T ex and the pattern P ex . In this example we 
consider the matching function F represented in Figure [TJ Figure [S] depicts a successful 
run of the algorithm finding the matching 46 2 9. H 

3.1. Correctness 

Proposition 3.10. P can be matched into T iff X^ is non-empty for some matching 
function F. 

Proof sketch. => If there is a matching of P into T, then there is at least one matching 
function F for which X^ is nonempty: 

The alternating run algorithm described in Section [3] does not find all possible matchings 
of P into T. For instance, in our running example (see Figure [TJ the matching 4 739 
is not found by the algorithm since it involves elements that are not representative. 
Nevertheless, in case a matching of P into T described by a function 93 : [k] — > [n] exists, 
the algorithm detects another matching if it does not find ip itself and thus terminates 
successfully. We proceed in the following way: We first define a matching function F 
and then show that the corresponding Xi, . . . , X^ are non-empty. 

In order to describe F, it is enough to determine the first element of every F(i), 
i G [run(P)]. Clearly, the first element of F(l) is T(l), i.e. the first element in T 
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We start with Xq. 

The only representative element (which is also the only valley) in 
F(r(l)) = F{2) is 2, therefore #1,(0,0,0) = {2}- 

There are 3 representative elements larger than 2 in F(r(2)) = F(l): 
8, 4 and 3. Since 2 is not the largest element in its run in P ex , 
condition (C2) implies that 3 is ruled out. Thus #2,(0,2,0) — {8,4}. 
Rule (Rl) yields X2 = X' 2 - Rule (R2) is not applicable. 

We have #3,(8,2,0) = {H} and #3,(4,2,0) = { 7 > 6 } implying X3 = 
{(11, 2, 0), (7, 2, 0), (6, 2, 0)}. Rule (Rl) discards (11, 2, 0) in favor of 
(7,2,0). Finally, Rule (R2) is applicable here and discards (7,2,0). 

The only representative element larger than 6 in F(r(4)) = F(3) is 
the element 9. The matching of P ex — 2314 into T ex found by the 
algorithm is thus 4629. 



Figure 5: The construction of Xq, . . . , X4 for our running example (T ex ,P ex ). 



- cf. (P3). When determining F(i), let j be the first element in i-th run in P. If 
the i-th run is a run up (down), the first element of F(i) is given by min^{/ : I 
<p(J) and I is an element following a peak (valley)}. This construction guarantees that 
the properties (P1)-(P4) are fulfilled. Constructing F in the described way for the 
matching 4 73 9 yields the matching function presented in Figure [TJ 

Now we show that the described algorithm can indeed find a matching. This follows 
from the fact that the existence of a matching implies for every k £ [k] the existence of 
at least one element that fulfills conditions (CI) and (C2) and that is not removed by 
rule (Rl) or (R2). 

(CI) The existence of <p(n) implies the existence of a representative element, as stated 
in the description of the algorithm, Part 2. 

(C2) Condition (C2) is fulfilled by and its corresponding representative element. 

(Rl) This rule always picks the smallest element in a vale. Since there is a matching 
involving <p(n), there is a matching containing the chosen element for this vale. 

(R2) Again, the chosen element is better than (p(n) since it is smaller or equal (the 
horizontal position does not matter here). 

Thus the existence of (p guarantees that the data structure X K never is empty. 

4= If there is a matching function F such that the corresponding X^ is non-empty, then 

a matching of P into T can be found: 

With the help of X\, . . . ,X}~ we shall construct a matching ip : [k] — > [n] of P into T. 



((0,0,0)) 

((0,2,0)) 



(8,2,0)) ((4,2,0)) 



((6,2,0)) 



((6,2,9)) 
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Start by picking some element x G and setting ip(k) ■= x r ^y Now suppose the 
matching has been determined for k £ [k] to iP(k) := y r t K \ for some y G X K . Then there 
must exist an element y' G X K -\ that has lead to the element y G X K , i.e. y differs from 
y' only at the r(«)-th entry and y r i K ) satisfies the conditions (CI) and (C2) for y'. We 
define ip(n — 1) := y' r ^ K _iy This defines the function ip : [k] — > [n]. We now prove that tp 
is a matching. First observe that is a monotonically increasing map because of (CI). 
Second the sequence ip(P) is a subsequence of T. This is because of (CI) (correct order 
of elements in P within a run), (C2) and the concept of matching functions (correct 
order of elements in P in different runs). The rules (Rl) and (R2) are only required for 
improving the runtime. □ 



3.2. Runtime 

Remark 3.11. In order to simplify the analysis of the algorithm we assume that if P 
ends with a run up (down) then T ends with a run down (up). This can be achieved by 
adding an element (n + 1) at the end of T. There is a matching of P into T iff P can 
be matched into the modified T. Furthermore, this element will never be chosen by the 
algorithm and therefore does not influence it. The only difference is that run(T) increases 
by 1. This, however, does not show in the O notation of the algorithm's runtime. 

We first want to bound the size of X K for every k G [0,k] and for a given matching 
function F. For the following definition recall the definition of vales (Definition 13. 7|) . 

Definition 3.12. Fori G [run(P)] and k G [k] let v l K denote the number of distinct vales 
the elements in {xi | x G X' K } are contained in. 

Lemma 3.13. Let F be a matching function, i G [run(P)] and k G [k]. If k is not the 

largest element in its run in P then 

i run(F(i)) 
2 



Proof. This is shown by induction over k. Since Xq = {(0, . . . , 0)} the statement clearly 
holds for v\ with i ^ r(l). For -u^ 1 ^ six different cases have to be considered. Since this 
step of the proof is the same for all v «^ , we will show it for any n£ [k]. These six cases 
are depicted in Figure [6j Continuous lines represent runs in F(t(k)) that may contain 
elements in {x r ( K ) | x G X' K }. Dashed lines represent runs consisting of elements that 
are not considered because of condition (C2). Recall that (C2) stated that if k was not 
the largest element in its run in P, only elements having larger elements to their right 
(left) were allowed to be considered for k in a run up (down). In all cases represented 
in Figure [6] the number of considered vales is bounded by run (^M) _ 

It remains to show the induction step. We already know that this step holds for i = 
r{n) (see Figure[6]). For all other i G [run(P)] we only have to observe that {xi \ x G X' K } 
is a subset of {x^ \ x G This is because the rules (Rl) and (R2) remove elements 
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Possible if k is 
in the first run in P 


Possible for arbitrary k 


Possible if k is 
in the last run in P 


& 
a 

Sh 

03 
_g 

a] 


V.... V\ 

At most ™W«)))-1 
vales are considered. 


N .... V\ 

At most 

vales are considered. 


N .... V 

Not necessary to consider 
because of Remark 13.111 


k is in a run down 


/V .... V 

At most 

vales are considered. 


V .... V 

At most run (^M» 
vales are considered. 


V .... w 

Not necessary to consider 
because of Remark |3.11| 



Figure 6: All six different cases that have to be considered when counting the number 
of vales in F(t(k)) for some k G [k]. 

from X' K _ 1 and in the step from A K _i to X' K only the r(K)-th component of elements in 
X K _i is modified. □ 

Lemma 3.14. For any given matching function F and every k G [k] 

run ( p ) ^CV'W 

< n 

i=i 

Proof. We prove this statement by induction over k G [0, k]. First, it is true for k = 
since |Ao| = 1. Now, how many elements may be added in a step from A K _i to X K 1 We 
have to consider two cases: 

1. k is the largest element in its run in P: 

In this case we apply the Rule (R2) as described in Section [3l As stated in Rule 
(R2) it is enough to consider a single representative element in i ? (r(Ac)) for each 
element in A" K _i. Thus every element in A" K _i leads to at most one element in X K 
and we have by the induction hypothesis: 

< iav.i < n '-^P 1 ^ 

i=i 

2. k is not the largest element in its run in P : 

In this case it is Rule (Rl) that guarantees the size bound for X K . From the 
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description of (Rl) it follows that 

\x K \ = \{[^\xex' K }\, 

since a minimal element is determined for every equivalence class [x]^. Irrespective 
of how many elements are contained in X' K , we can give an upper bound for the 
total number of equivalence classes [x]^. By Definition 13.71 and 13.121 we know that 

run(P) 

|{[x]„ \xeX' K }\< n <■ 

i=l 

Then Lemma 13.131 implies that 

run ( p ) 

w=i(M~ix«ai< n 



We have thus proven the desired upper bound for \X K \ for every k £ [k]. □ 

Proposition 3.15. The runtime of the alternating run algorithm is 0*(1.784 run ( T )) and 
thus also O* (1.784"). 



Proof. We know from Lemma 13.141 that 

run(P) 



x.i < n , - } ™ ID 



i=l 

for every k£ [k]. Observe that 

run(P) 

run(F(i)) = run(T) + run(P) - 1, (2) 

i=i 

since two subsequent .F(i)'s have one run in common (see Figure [2]). The inequality of 
geometric and arithmetic means implies that this product is maximal if all run(i ? (i)) are 
equal, i.e. for every i £ run(i- > ) 

= n.n(D + n.n(f)-l 
v v run(P) 

Therefore X K has at most ^ — - 1 ^ elements for every n E [k]. To find 

an upper bound for this function requires extensive calculations which we detail in 
Appendix^ It follows that \X K \ < 1.2611 run ( T ). 

Given an element in x £ X K ^\ computing N K x clearly requires only polynomial time. 
In total, computing X' K takes 0*(X K ^) = O* (1.261 l mn ( T )) time. The size of X' K is then 
at most |X K _i| • run(i ? (r(«))), since the conditions (CI) and (C2) do not hold for more 
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than rur\(F(r(n))) many elements per element in X K —\. Rule (Rl) can be executed by 
sorting X' K and hence requires 0*(\X' K \ ■ \og(\X' R \)) = C*(1.2611 mn(T) ) time. The same 
holds for Rule (R2). 

By Lemma 13.51 there are at most \/2 mn T matching functions F we have to consider. 
For each of these, Xq, . . . , X^ has to be computed. Thus the runtime of the alternating 
run algorithm is in 0*(V2 ru " {T) • 1.2611 run ( T >) = C*(1.784 mn ( T )). Moreover it always 
holds that run(T) < n and thus the runtime of the alternating run algorithm is also 
0*(1.784 n ) □ 



3.3. The parameter run(P) 

We present two XP results for PPM with respect to the parameter run(P). The first 
result follows from the alternating algorithm when analyzing its runtime with respect to 
run(P). The second result follows from an algorithm in pQ and a lemma shown in this 
paper. 

Proposition 3.16. The runtime of the alternating run algorithm is 

o \ mn(P)\ 



O* 



n 



2run(P) 



Proof. In the proof of Proposition 13.151 we have seen that X R has at most (run(T) + 
run(P) — l)/(2run(P)) mn ( p ) elements. We can of course assume that run(P) < n, 
since run(P) > n would imply that P has more elements than T and then there 
clearly is no matching of P into T. At the same time run(T) < n always holds. 
Thus \X K \ < (2n/(2run(P))) mn(p) for all k G [k]. Again, as in the proof of Propo- 
sition [3J2] constructing X K from X K _i can be done in time 0*(\X K _i\), i.e. in time 
0*((2n/(2run(P))) mn(p) ) and has to be done |P| < n times. 

Moreover we saw in Lemma 13.51 that the number of matching functions F is at most 



n\ run(P) 

run(P) -lj " vr "" v ^ " \2, 



For each of these P's, Xo, . . . , X^ has to be computed. Thus the runtime of the alternat- 
ing run algorithm is 0*((n/2) mn ( p ) • (n/(run(P))) mn(p) ) proving the desired XP runtime 
bound. □ 

Before we can state the theorem proven in pQ which we will build upon, we need the 
following definitions. 

Definition 3.17. Given a pattern tt of length m, the graph G n = (V,E) is defined as 
follows: The vertices V := [to] represent positions in tt. There are edges between adjacent 
positions, i.e. E\ := {{i,i + 1} | i € \m — 1]}. There are also edges between positions 
where the corresponding values have a difference ofl, i.e. E2 '■= {{i,j} \ 7r(i) — 7r(j) = l}. 
The edge set is defined as E := E\ U E%. 
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Figure 7: To the left a graphical representation of the permutation tt introduced in Ex- 
ample 13.181 to the right the corresponding graph G n 

Example 3.18. Consider the permutation 

_/l23456789\ 
7r ~V25974683lJ 

written in two-line representation. A graphical representation of tt can be found on 
the left-hand side of Figure 13.181 The corresponding graph G^ is represented on the 
right-hand side of the same figure. The solid lines correspond to the edges in E\ and 
the dashed lines to the ones in E%. H 

Definition 3.19. Let G = (V,E) be a simple graph, i.e. E consists of subsets of V of 
cardinality 2. A path decomposition of G is a sequence X\, . . . , Xk of subsets of V such 
that 

1. Every vertex appears in at least one X{, i G [k]. 

2. Every edge is a subset of at least one X{, i E [k]. 

3. Let three indices l<h<i<j<k be given. If x G X% and x G Xj then 

The width of a path decomposition is defined as max{|Xi|, . . . , \Xk\} — 1. The pathwidth 
of a graph G, written pw(Gr), is the minimum width of any path decomposition. 

Theorem 3.20 (Theorem 2.7 and Proposition 3.5 in [1]). PPM can be solved in time 
0(n l+pw ( Gp ^). 

Lemma 3.21. pw(G 7r ) < run(7r) holds for all permutations tt. 
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Proof. Given a permutation tt of length m we will define a sequence X\, . . . ,X m . We 
then show that this sequence is a path decomposition of G n = (V, E) with width at most 
run(7r). In this proof we use the variables i,j for positions in tt and the variables u,v,w 
for values of tt, i.e. 7r(l),7r(2), etc. 

In order to define the sequence X%, . . . , X m of subsets of V, we shall extend alternating 
runs to maximal monotone subsequences. This means that we add the preceding valley 
to a run up and the preceding peak to a run down. For any s G [run(7r)], R s then denotes 
the set of elements in the s-th run in tt together with the preceding valley or peak. Note 
that this implies that \R S n i? s +i| = 1 for all s G [run(7r) — 1]. 

We define X[ := {1} and 

X' v := { max(i? j D [v - 1]) | j G [run(vr)] and Rj n [v - 1] ^ 0} U {v} 

for every v G {2, . . . ,m}. Since X v should contain positions (and not elements) in tt we 
define 

X v := {7r- x H \weX' v }. 
We now check that X\, . . . , X m indeed is a path decomposition. 

1. The vertex i appears in X n u\. 

2. First we consider edges of the form + 1}. We distinguish two cases. 

• 1. case: Tr(i) < tt{i + 1). 

Then {i, i + 1} is a subset of X n r i+ i\. Clearly, i + 1 G X w r i+ xy Since ir(i) 
and Tr(i + 1) are adjacent in tt there has to be a s G [run(7r)] such {vr(i), n(i + 
1)} C R s . It then holds that max(i? s Pi + 1) — 1]) = 7r(z) since 7r(i) G 
R s Pi [7r(z + 1) — 1] and 7r(«) is the largest element in R s smaller than ir(i + 1). 
Consequently i G X T ( i+ iy 

• 2. case: ir(i + 1) < vr(i). 

Analogously one sees that {i, i + 1} is a subset of X n u\. 

Every edge {i,j} G E with ir(i) — = 1 is a subset of X^uy. As before i G X % uy 
Let s be any element of [run(-7r)] such that j G R s . Then max(i? s n [vr(i) — 1]) = 
max(R s n [vr(j)]) = 7r(j) and hence j G ^(j). 

3. Let l<u<v<w<m with i G X u and i G X w . Let s be any element of [run(-7r)] 
such that i G R s . Then either Tr(i) G R s Pi [u — 1] or 7r(«) = u. In both cases is 

G i2 s n[u]. Furthermore, 7r(i) = max(i? s n[w]) = max(i? s n [v]). Hence i G X„. 

The cardinality of each Xi is at most run(7r) + 1 and hence pw(G 7r ) < run(-7r). □ 

Remark 3.22. This bound is tight since G n for 7r = 123...misa path and hence has 
pathwidth 1. 

Example 3.23. Consider again tt as defined in Example 13.181 The elements of the sets 
X[ , . . . , Xg and those of X\ , . . . , Xg as defined in the proof of Lemma 13.211 are given 
in Figure 13.231 It is easy to check that X%, ... ,Xg indeed is a path decomposition of 
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Figure 8: The sets X[ , . . . , Xg and X\ , . . . , Xg for the permutation ir = 259746831 

width 4 = run(-7r). Note that in the given table, columns of equal numbers do not 
contain any gaps. This fact corresponds to the third condition in the definition of path 
decompositions. H 

Corollary 3.24. PPM can be solved in time 0(n 1+run ^). 

4. W[l]-hardness for the parameter run(P) 

The following hardness result shows that we cannot hope to substantially improve the 
XP results for PPM parameterized by run(P) (Proposition 13.161 and Corollary I3.24p . 
Indeed, an fpt algorithm with respect to run(P) is only possible if FPT = W[l]. 

Theorem 4.1. PPM is \N[l]-hard with respect to the parameter run(P). 



Proof. The following proof is inspired by the proof given in [6] for the N P-completeness 
of PPM. We show W[l]-hardness by giving an fpt-reduction from the following problem 
to PPM: 



Clique 




Instance: 


A graph G = (V, E) and a positive integer k. 


Parameter: 


k 


Question: 


Is there a subset of vertices S C V of size k such that S forms 
a clique, i.e. the induced subgraph G[S] is complete? 



Let (G,k) be a Clique instance, where V = {v±, V2, ■ ■ ■ , v{\ is the set of vertices 
and E = {ei, e2, . . . , e m } the set of edges. We are going to construct a PPM instance 
(P, T). We shall first construct a pair (P',T'). P' and T' are both permutations on 
multisets, i.e. permutations in which elements may occur more than once. Applying 
Definition [2J] to permutations on multisets means that in a matching repeated elements 
in the pattern have to be mapped to repeated elements in the text. Afterwards we deal 
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with the repeated elements in order to create a pattern and a permutation on ordinary 
sets and hereby obtain (P,T). 

Both the pattern and the text consist of a single substring coding vertices (P resp. 
T) and substrings coding edges (Pj resp. Tj for the i-th substring). These substrings 
are listed one after the other, with guard elements placed in between them to ensure 
that substrings have to be matched to corresponding substrings. For the moment, we 
will simply write brackets for the guard elements, indicating that a block of elements 
enclosed by a [to the left and a ] to the right has to be matched into another block of 
elements between two such brackets. In addition, a guard block Gp is placed at the end 
of P. The construction of these guards shall be described later on. 

We define the pattern to be 

P' := [P] \P 1 ][P 2 -]\...-]\P k{k _ 1)/2 ]G P 

= [123 ...k] [12] [13] [. . .] \lk] [23] [. . .] \2k] [. . .] [(Jfe - l)k] G P . 

P corresponds to a list of (indices of) k vertices. The Pi's represent all possible edges 
between the k vertices (in lexicographic order). 
For the text 

T':= [T] [fx] [T 2 ] [...] \f m ]G T 

we proceed similarly. T is a list of the (indices of the) I vertices of G. The Tj's represent 
all edges in G (again in lexicographic order). Let us give an example: 

Example. Let I = 6 and k = 3. Then the pattern permutation is given by 

P' = [123] [12] [13] [23] G P . 

Consider for instance the graph G with six vertices vx, . . . , vq and edge-set 

{{1, 2} , {1, 6} , {2, 3} , {2, 4} , {2, 5} , {3, 5} , {4, 5} , {4, 6}} . 

represented in Figure [9] (we write {i,j} instead of {vi,Vj}). 
Then the text permutation is given by: 

T' = [123456] [12] [16] [23] [24] [25] [35] [45] [46] Gt ■ 

H 

Claim 1. A clique of size k can be found in G if and only if there is a simultaneous 
matching of P into T and of every Pi into some Tj. 



Example (continuation). In our example {^2,^3,^5} is a clique of size three. 
Indeed, the pattern P' can be matched into T' as can be seen by matching the 
elements 1,2 and 3 onto 2,3 and 5 respectively. See again Figure [9] where the 
involved vertices respectively elements of the text permutation have been marked 
in gray. H 
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Figure 9: An example for the reduction of an Independent Set instance to a PPM 
instance. 

Proof of Claim{l\ A matching of P into T corresponds to a selection of k vertices 
amongst the I vertices of G. If it is possible to additionally match every one of the 
.P's into a T this means that all possible edges between the selected vertices appear in 
G. This is because T' only contains pairs of indices that correspond to edges appearing 
in the graph. The selected k vertices thus form a clique in G. Conversely, if for every 
possible matching of P into T defined by a monotone map (p : [k] — > [I] some P\ = xy 
cannot be matched into T", this means that {cp(x), <p(y)} does not appear as an edge in 
G. Thus, for every selection of k vertices there will always be at least one pair of vertices 
that are not connected by an edge and therefore there is no clique of size k in G. □ 

In order to get rid of repeated elements, we identify every variable with a real interval: 
1 corresponds to the interval [1, 1.9], 2 to [2, 2.9] and so on until finally k corresponds to 
[k, k + 0.9] (resp. I to [I, I + 0.9]). In P and T we shall therefore replace every element j 
by the pair of elements (j + 0.9, j) (in this order). The occurrences of j in the Pj's (resp. 
Tj's) shall then successively be replaced by real numbers in the interval [j,j + 0.9]. For 
every j, these values are chosen one after the other (from left to right), always picking a 
real number that is larger than all the previously chosen ones in the interval [j,j + 0.9]. 

Observe the following: The obtained sequence is not a permutation in the classical 
sense since it consists of real numbers. However, by replacing the smallest number by 1, 
the second smallest by 2 and so on, we do obtain an ordinary permutation. This defines 
P and T (except for the guard elements). 

Example (continuation). Getting rid of repetitions in the pattern of the above 
example could for instance be done in the following way: 

P = [1.9 1 2.9 2 3.9 3] [1.1 2.1] [1.2 3.1] [2.2 3.2]G P 
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This permutation of real numbers is order-isomorphic to the following ordinary 
permutation: 

P= [4185129] [26] [310] \7U]G P . 

H 

Claim 2. P can be matched into T iff P' can be matched into T' . 

Proof of Claim® Suppose that P' can be matched into T' . When matching P into T, 
we have to make sure that elements in P that were copies of some repeated element in P' 
may still be mapped to elements in T that were copies themselves in T" . Indeed this is 
possible since we have chosen the real numbers replacing repeated elements in increasing 
order. If i in P' was matched to j in T', then the pair (z + 0.9, i) in P may be matched to 
the pair (j + 0.9,j) in T and the increasing sequence of elements in the interval [i, z + 0.9] 
may be matched into the increasing sequence of elements in the interval [j,j + 0.9]. 

Now suppose that P can be matched into T. In order to prove that this implies that 
P' can be matched into T', we merely need to show that elements in P that were copies 
of some repeated element in P' have to be mapped to elements in T that were copies 
themselves in T'. Then returning to repeated elements clearly preserves the matching. 
Firstly, it is clear that a pair of consecutive elements i + 0.9 and i in P has to be 
matched to some pair of consecutive elements j + 0.9 and j in T, since j is the only 
element smaller than j + 0.9 and appearing to its right. Thus intervals are matched to 
intervals. Secondly, an element x in P for which it holds that i < x < i + 0.9 must be 
matched to an element y in T for which it holds that j < y < j + 0.9. Thus copies of an 
element are still matched to copies of some other element. 

Finally, replacing real numbers by integers does not change the permutations in any 
relevant way. □ 

It remains to implement the guards in order to ensure that substrings are matched to 
corresponding substrings. 

At the end of P we place a guard block Gp consisting of a decreasing sequence of 
elements that are larger than all those occurring in P and that is at least as long 
as the permutation T (considered with guard elements). In P we have so far used 
2k + k(k - 1) = k + k 2 integers and T consists of at most 2 + 21 + 21(1 - 1) = 2 + 
2l 2 integers since there are at most /(/ — l)/2 edges occurring in G and every edge 
i,j in encoded by the four elements Therefore Gp is the decreasing sequence 

of 2 + 2l 2 values 2 + 2l 2 + k + k 2 , . . . , 2 + k + k 2 , 1 + k + k 2 . As guard elements we 
use k(k — 1) + 2 integers larger than the largest element previously used. The largest 
guard element is thus equal to 4 + 2l 2 + 2k 2 and shall be denoted by P max - Then 
we insert the guards in the designated positions (previously marked by [ and ]) in 
the following order: P max — 1 (instead of the first [),-P m ax (instead of the first ]),-P m ax — 
3 (instead of the second [),P m ax — 2 (instead of the second ]),... , P max — 2i—l (instead 
of the (i + l)-th \),P max — 2i (instead of the (i + l)-th ]),..., and so on until we reach 
the last guard-position. The guard elements are inserted in this specific order to ensure 
that two neighboring guard elements [ and ] in P have to be mapped to two neighboring 
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guard elements [ and ] in T. We proceed similarly in T. Gt is the decreasing sequence of 
2 + 2l 2 values larger than the largest element of T. The sequence T max — 1, T max , T max — 
3,T max — 2, . . . where T max has been chosen accordingly is inserted in the guard-positions. 

Claim 3. Implementing the guards in the described way guarantees that in a matching 
of P into T , P is matched into T and each P is matched into a T. 

Proof of Claim\^ The two decreasing subsequences Gp and Gt added at the end of P 
and T indeed serve as guards. When the last element of P is matched into T, there are 
two possibilities. The first possibility is to map the last element either onto one of the 
large elements in Gt or to one of the guard elements in T. Then all larger elements 
in P have to be mapped to larger elements in T which means that all guard elements 
(including the decreasing subsequence in Gp) in P have to be mapped to guard elements 
in T. A pair (i, i + of guard elements in P must be mapped to a pair (j, j + 1) of guard 
elements in T since (j + 1) is the only element larger than j and standing to its right. 
Consecutive guards in the pattern must be mapped to consecutive guards in the text, 
this implies that the elements in between these pattern guards must also be mapped 
to the elements in between the corresponding text guards. This is what we wanted: 
substrings have to be mapped to corresponding substrings. 

The other possibility is to map the last element of P to some element x that is not a 
guard element in T. But then the remaining elements of P that are larger than the last 
one - these necessarily are more than 2 + 2l 2 since this is the length of Gp - have to be 
matched into less than 2 + 2l 2 elements in T since there are at most 1 + 2l 2 elements 
lying to the left of x. This is not possible. □ 

This finally yields that (G, k) is a YES-instance of Clique if and only if (P, T) is 
a YES-instance of PPM. It remains to show that this reduction can be done in fpt- 
time. When counting the alternating runs in P we may consider two parts: the main 
part Mp of P consisting of all the non-guard elements as well as the guards placed in 
between them and the guard block Gp consisting of the decreasing subsequence of length 
2 + 2l 2 . The length of M P is equal to k + k 2 + 2 + k(k - 1) = 2 + 2k 2 and thus clearly 
run(Mp) < 2 + 2k 2 = 0{k 2 ). Since Gp consists of a single run down, it follows that 
run(P) = run(Afp) + 1 = 0{k 2 ). Moreover the length of T is bounded by a polynomial 
in the size of G since \T\ < 4(1 + I 2 ) = 0{l 2 ) = 0{\G\ 2 ). □ 

Remark 4.2. Since in the fpt-reduction given in the proof of Theorem 14.11 the length of 
the pattern can be bounded by a polynomial in the size of G, this is also a polynomial 
time reduction. Therefore the proof of Theorem 14.11 can also be seen as an alternative 
way of showing NP-completeness for PPM. 

5. Future work 

Theorem 13.11 shows fixed-parameter tractability of PPM with respect to run(T). An 
immediate consequence is that any PPM instance can be reduced by polynomial time 
preprocessing to an equivalent instance - a kernel - of size depending solely on run(T). 
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This raises the question whether even a polynomial-sized kernel exists. Another research 
direction is the study of further parameters such as permutation statistics listed in the 
Appendix A of |14j . The major open problem in this regard is whether PPM is fpt with 
respect to the length of P. Finally, our method of making use of alternating runs might 
lead to fast algorithms for other permutation based problems. 
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A. Appendix: Calculations leading to size bounds for the data 
structure X K 

This appendix details the calculations in the proof of Theorem 13.151 We know that 
1**1 < UZi iP) '-^T^ from LemmaEHand 

run(P) 

run(F(i)) = run(T) + run(P) - 1 (3) 

i=i 

from the proof of Theorem 13. 151 In the following we write r := run(P) and t := run(T). 
From the inequaltiy of geometric and arithmetic means it follows that 

A run(F(i)) < f t + r-1 

J-J- 2 - I 2-r 

i=i v 

and equality in the above inequation only holds when wn(F(i)) is equal for all i G [r]. 
Thus, n[=i mn ^^ is maximal for 

t + r — 1 

run(F(i)) = for all i G [r]. 

r 

Now we want to determine the maximum of the function 



g{r) 

that is an upper bound for \X K \. 
r \ V r J 



t + r-1 
27 



r 



((r + i - 1) log (— _ ) -rlog(2)-i(l + log(2)) + l + log(2))J =0 

(r + 1 - 1) flog ( r + *~ 1 J - log(2)) - t + 1 = 



log 

The solutions are: 



r + t- 1\ i- 1 



2r / r + i - 1 



ri ( t ) = (_i + t )/(_i + 2e 1 + w/ o(-i/(2e)) ) 

r2 (t) = (_l + t)/(_l + 2e 1 +^(- 1 /(2e)) )) 

where Wq is the principal and W-\ is the lower branch of the Lambert function defined 
by the equation x = W(x) ■ e w<yX \ It holds that 

(-1 + t)/3.311071 < n{t) < (-l + t)/3.311070 
(-1 + t)/- 0.62663 < r 2 (t) < (-1 + t)/- 0.62664, 
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The second solution ^(i) is negative and therefore of no interest to us. 

, , /t + (-l + t)/3.311070 -l^ ( " 1+t)/3 - 311070 

g(n) < 



2(-l + i)/3.311071 
< 0.80 • ( 2.155535 '* 



< 0.80 • (1.261071)*. 

It therefore holds that 



\XJ < 1.2611 run(T) . 



Combining this with the fact that there are at most y2 matching functions (as 

shown in Lemma 13.51) leads to the runtime: 



C>((1.2611 • V2) run ^ • poly(n)) < O* (l.78A run ^). 
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